Compressed-Sampling-Based Image Saliency Detection in the Wavelet Domain

نویسندگان

چکیده مقاله:

When watching natural scenes, an overwhelming amount of information is delivered to the Human Visual System (HVS). The optic nerve is estimated to receive around 108 bits of information a second. This large amount of information can’t be processed right away through our neural system. Visual attention mechanism enables HVS to spend neural resources efficiently, only on the selected parts of the scene at order. This results in a better and faster perception of events. In order to perform saliency measurement on visual data, subjective eye-tracking experiments may be carried out. These experiments involve using devices to track eye movements of a number of subjects while they watch images or videos on a screen. That being said, such devices are not very suitable in practice due to hardship involved with carrying out experiments, such as need to have restricted test environment, being time consuming as well as expensive. Instead, researchers developed Computational Visual Attention Models (VAMs) in attempts to mimic the HVS saliency prediction process. Visual Attention Modelling has widely been used in various areas of image processing and understanding. Computational models of visual attention aim to predict the most interesting areas of an image to the observers. To this end, these models produce saliency maps, in which each pixel is assigned a likelihood value of being looked at. In other words, saliency maps highlight where the most likely for viewers to look at in an image is. Knowing the Regions of Interests (ROIs) can be helpful in applications such as image and video compression, object recognition and detection, visual search, retargeting, retrieval, image matching, and segmentation. Saliency prediction is generally done in a bottom-up, top-down, or hybrid fashion. Bottom-up approaches exploit low-level attributes such as brightness, color, edges, texture, etc. Top-down approaches focus on context-dependent information from the scene such as appearance of humans, animals, text, etc. Hybrid methods combine the two streams. This paper proposes a new method of saliency prediction using sparse wavelet coefficients selected from low-level bottom-up saliency features. Wavelet based image methods are used widely in image processing algorithms as they are especially powerful in decomposing images into several scales of resolutions. In our method, first random compressive sampling is performed on wavelet coefficients in the Lab color space. Random sampling enables a reduction in computational complexity and provides a sparse representation of the coefficients. The number of decomposition levels is chosen based on the information diffusion property of the signal. In the proposed method, the sampling can be done at a rate different than the Nyquist rate, and based on the sparsity degree of the signal. It is shown that having the basis vectors of a sparse representation of the signal, can result in an accurate signal reconstruction. In this work, the sparsity degree and thus the sampling rate is computed empirically. Next, local and global saliency maps are generated from these random samples to account for small-scale and large-scale (scene-wide) saliency attributes. These maps are then combined to form an overall saliency map. The overall saliency map therefore includes both local, and global saliency attributes. The main contribution of this paper is the use of compressive sampling in creating a novel wavelet domain representation for image saliency prediction. Extensive performance evaluations show that the proposed method provides a promising saliency prediction performance while the computation complexity remains reasonable, thanks to the dimensionality reduction of compressive sampling. In particular, the proposed method demonstrated favorable precision, recall, and F-measure, when compared to state-of-the-art saliency detection methods, over large-scale datasets. We hope the proposed approach brings ideas to the saliency analysis research community.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic-Based Image Retrial in the VQ Compressed Domain using Image Annotation Statistical Models

متن کامل

semantic-based image retrial in the vq compressed domain using image annotation statistical models

متن کامل

Compressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard

Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...

متن کامل

Video Segmentation in the Wavelet Compressed Domain

Automatic video indexing is an important feature in the design of a video database. Recently, compressed domain techniques have become popular due to their inherent advantages of efficiency and lower complexity. On the other hand, wavelet transform has emerged as a powerful tool for efficient compression of visual information. A variety of wavelet-based video compression techniques have been re...

متن کامل

An IHS based Multiscale Pansharpening Technique using Wavelet based Image Fusion and Saliency Detection

With the advent of remote sensing, the requirement of high resolution images that can provide high quality spectral and spatial information has increased. Due to cost and circuitry issues, the satellite sensors are capable of providing either multispectral (MS) images with high spectral and low spatial resolution or Panchromatic (pan) images with high spatial and low spectral resolution. Pansha...

متن کامل

Wavelet-based Scale Saliency

9 Both pixel-based scale saliency (PSS) and basis project methods focus on multiscale analysis of data content and structure. Their theoretical relations and practical combination are previously discussed. However, no models have ever been proposed for calculating scale saliency on basis-projected descriptors since then. This paper extend those ideas into mathematical models and implement them ...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال

پردازش علائم و داده ها

دوره 16 شماره 4

صفحات 59- 72

تاریخ انتشار 2020-03

دنبال کردن

لغو دنبال کردن

{@ msg @}

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

کلمات کلیدی برای این مقاله ارائه نشده است

میزبانی شده توسط پلتفرم ابری doprax.com